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O . Abstract 

j^ ■ Peer-to-Peer (P2P) systems have proved to be the most effective and popular file sharing applications in 
recent years. Previous studies mainly focus on the equal service and the differentiated service strategies 
when peers have no initial data before their download. In an upload-constrained P2P file sharing system, 
we model both the equal service process and the differentiated service process when peers' initial data 
distribution satisfies some special conditions, and also show how to minimize the time to get the file 
to any number of peers. The proposed models can reveal the intrinsic relations among the initial data 
amount, the size of peer set and the minimum last finish time. By using the models, we can also provide 
c/j ■ arbitrary degree of differentiated service to a certain number of peers. We believe that our analysis 
^, 1 process and achieved theoretical results could provide fundamental insights into studies on bandwidth 
allocation and data scheduling, and can give helpful reference both for improving system performance 
and building effective incentive mechanism in P2P file sharing systems. 
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1 Introduction 

Peer-to-Peer (P2P) systems have proved to be the most effective and popular file sharing appli- 
cation in recent years. Bit Torrent (BT) [1] is the most popular P2P application due to its high 
degree of scalability and effective incentive mechanism to reduce free-riding. 

Recently, most of studies on P2P systems are focused on social network, incentive mechanism 
[2] [3], piece selection algorithm [4] (especially directed at media streaming) etc. However, the 
bandwidth allocation and data scheduling for differentiated service in application layer has been 
paid little attention. We consider this problem to be one of the most fundamental and important 
problems because it determines how the data and bandwidth resource is allocated among peers 
and plays a decisive role in respect of improving the performance of P2P systems. In addition, 
theoretical analysis for differentiated service can also give some insight into the design of incentive 
mechanism. 
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There have been a lot of analytical studies focused on the overlay topology, service capacity 
and piece scheduling etc., when peers are homogeneous with the same download and upload 
bandwidth. Yang et al. [5] [6] propose a branching process for transient state and a Markov chain 
model for steady state of P2P systems. Qiu et al. [7] present a simple fluid model in Bit Torrent- 
like system for the first time, and they evaluate the scalability, performance and efficiency of such 
kind of P2P systems. However, this model only focuses on the peer arrival rate, departure rate 
in steady state, and makes an unrealistic assumption that all peers have homogeneous upload 
bandwidth. Based on the fluid model in [7], Clevenot-Perronnin et al. [8] propose a two-set 
fluid model for service differentiation in BitTorrent-like content distribution systems, whereas 
they assume peers in the same class have the same bandwidth. Biersack et al. [9] propose three 
typical analytical models, i.e. linear model, k-tree model, /c-Ptree model, to address the problem 
that how long it should take to distribute a file to A^ peers. Results indicate that the service 
capacity of most P2P systems grows exponentially with the number of pieces in a file. Esposito 
et al. [10] adapt the fiuid model in [7] to transient phase of P2P file disseminating and propose 
a seed scheduling algorithm called proportional fair seed scheduling (PFS). They show that their 
solution can effectively reduce the average download time than the standard Bit Torrent protocol. 

Under a more general condition that peers have heterogeneous bandwidth, some studies inves- 
tigate the system performance (mostly in terms of last finish time or average finish time) and 
fairness (mostly in terms of peers' contributions to the system) to provide fundamental principles 
and criteria for P2P protocol and application system design. Wu et al. [11] propose a centrally 
scheduled file distribution (CSFD) protocol that can provably minimize the last finish time of a 
single source file distribution process. Mundinger et al. [12] [13] characterize the problem that 
how to minimize the last finish time when peers have heterogeneous upload bandwidth and the 
file can only be divided into discrete pieces. They also evaluate the minimum last finish time 
when all peers receive equal service and the file can be divided into infinitely small pieces, and 
show how each peer should schedule its bandwidth and data to achieve the lower bound. Kumar 
et al. [14] investigate file distribution with differentiated service in two sets of peers, with the 
purpose of distributing the file as quickly as possible to all the first-set peers. Also, they de- 
rive explicit expressions for the minimum distribution time. They make an assumption that the 
second-set peers have no initial data before download. Mehyar et al. [15] extend the results in 
[14] and examine different performance criteria (e.g. last finish time, average finish time, etc.) by 
considering how the pieces and the bandwidth resources can be allocated in order to reach the 
theoretical bounds. Fan et al. [16] analyze the delicate tradeoff between system performance and 
system fairness in BitTorrent-like system, and show that the default configuration in Bit Torrent 
puts more emphasis on fairness rather than performance. Chan et al. [17] deduce a theoretical 
bound on the minimum last finish time in terms of discrete cycles. They also evaluate several 
scheduling algorithm suitable for the push-based model, e.g.. Rarest piece first, Most Demanding 
node first and the Maximum-flow algorithm, and simulation results show that the Maximum-flow 
algorithm outperforms all other algorithms. Ezovski et al. [18] show how each peer should use 
its upload bandwidth to minimize the average finish time in a fully-connected network without 
the presence of peer leaving by using the water-filling technique. Wen et al. [19] extend Ezovski 
et al.'s studies by taking peer leaving condition into account. They present the reversed water 
filling algorithm (RWF) in order to minimize the average finish time. However, no theoretical 
lower bound is calculated because of the complexity of the problem. 

This paper is inspired by the seminal studies [12] [14] [15] [18] [19], and can be regarded as an 
extension of [14] [15] [18]. In addition, the analysis methods (e.g. the water-filling technique) in 



[18] [19] have some significant implications for our researcli. To tlie best of our knowledge, there 
is no theoretical analysis towards the file sharing in P2P network where peers with heterogeneous 
upload bandwidth already have some initial data before their download. In [12] [15], the equal 
service model and the two-set differentiated service model (including the Multiplicity Theorem) are 
first proposed, and the system performance is also evaluated in terms of the last finish time, etc. 
In [14], towards the multi-set problem, they consider a two-set system for simplicity, nevertheless, 
how the second-set peers can finish their download has not been further considered. Based on their 
work, we present extended equal service model and differentiated service model when peers' initial 
data distribution satisfies some special conditions {UCP condition), deduce the ip- Multiplicity 
Theorem, and show how to make the best use of all peers' upload bandwidth to minimize the last 
finish time for peers in the superior set. Both our analysis process and achieved theoretical results 
could provide fundamental insights into studies on bandwidth allocation and data scheduling, and 
can give helpful reference both for improving system performance and building effective incentive 
mechanism in P2P file sharing systems. 

The rest of this paper is organized as follows. Section 2 sums up and refines a few fundamental 
results in previous work. Section 3 discusses the concept of multiplicity in detail and presents 
our extended equal service and differentiated service models. Some numerical results are given in 
section 4 and we make the conclusion in section 5. 



2 Preliminary 

2.1 Assumptions and Metrics 

According to the actual application background and the need of the analysis, most of related 
works make the following reasonable assumptions, which will also be adopted in this paper: 

• The file can be divided into infinitesimal pieces. Each peer can simultaneously upload to as 
many peers as needed. Although in real P2P application systems, the maximum concurrent 
uploads is a fixed value. This assumption is consistent with [12] [13] [14] [18] etc. 

• Upload bandwidth rather than download bandwidth is the only bottleneck. This assumption 
holds because in reality users always have asymmetric access bandwidth. This assumption 
is consistent with [14] [15] [18] [19] etc. 

Unlike the P2P live streaming case where emphasis should be given onto the transmission delay 
[20] and the streaming capacity [21] etc., the following two metrics involving the download time 
should be considered in detail, as we assume that a file can only be used after its last bit is 
obtained by a peer. 

• Last finish time: The time interval every peer in a specified peer set obtains the whole file. 

• Bottleneck time: The time interval that the source can distribute its fresh data (the data 
no one else has) to the swarm. 

Obviously, for any set of peers, the last finish time is no less than the bottleneck time. One of 
the most important targets in this paper is to minimize the last finish time. 



2.2 Fundamental Results in Previous Work 

Given an upload-constrained fully connected P2P network, if all of N peers receive the same ser- 
vice {equal service) from a source, the minimum last finish time Tl is given [12] [14] by the following 
equation with symbols modified for consistency, where Cq is the source's upload bandwidth, U is 
the sum of all peers' upload bandwidth, and F is the file size. 

F 

^' = minjCo,^} ^^^ 

If there are two sets of peers, and the first L peers have superior priority {differentiated service), 
the minimum last finish time of peers in the first set is given [14] by the following equation, where 
Cn-l is f he total upload bandwidth of the last N — L peers. 

^^ = \ 7T^ ^^T (2) 

min|Co,^Co + f/-%^)| 
The bottleneck time tb is calculated by its definition as: 

h = ^ (3) 

Furthermore, Mehyar et al. [15] propose the following theorem to calculate at most how many 
peers can finish their download in the bottleneck time. 

Multiplicity Theorem: It is possible to let the first M peers finish their download in the bottle- 
neck time tfe if and only if: 

M N 

j=l i=M+l 

where q is the upload bandwidth of peer i. For convenience of expression, we use Cm and Cn-m 
to denote the total upload bandwidth of the first L peers and the last N — L peers respectively, 
hence above inequality can be rewritten as Co < j^Cm + jjCn-m- 

For a given P2P system, multiplicity is defined as the largest M such that inequality (4) holds. 

It is worth noting that, the multiplicity here refers specifically to the M in the case that all 
peers are put in descending order by their upload bandwidth, as M can reach the maximum in 
this case. In this paper, the multiplicity we refer to is the largest integer M such that inequality 
(4) holds, where peers are put in random order in advance. The discussions in what follows will 
not put a limit on the order of peers either. 

Owing to space constraints, the strategies to achieve the above theoretical lower (upper) bounds 
in related literatures will not be given in this paper. 

2.3 Differentiated service model when peers have no initial data 

As section 2.2 shows, Tl has already been deduced, however, there is no unified form for Tl in 
the differentiated service model [14] and the Multiplicity Theorem [15]. In this section, we will 



reformulate Tl when there is no initial data before download. 
For convenience of expression, we define the multiplicity function: 

When M = N, F{M) is defined by jjziCm] when M = 1, F{M) becomes infinity [15]. 
According to the magnitude of L, there are two cases to be considered. 

1) L<M 

In this case, the last finish time of the first L peers is equivalent to the bottleneck time, i.e.: 

Tl = tb (6) 

2) L> M 

Since L > M, we can deduce F{L) < F{M). In this case, the first L peers can reach the 
maximum download rate, and T^ can be calculated as follows: 

F 

3 Extended equal service and differentiated service mod- 
els 

3.1 Data Distribution 

Researches in [12] [13] [14] [15] etc. all make an assumption that each peer has no data before 
download. In this paper, in order to study how the initial data will put an effect on the whole file 
dissemination process, we will consider the cases when the data distribution satisfies two special 
conditions. 

According to the previous section, in order to provide peers in LI with superior service, peer 
L + i in L2 has to download at certain rate /i < \cLj^i from the source, then replicates this data 
stream to all L peers in LI. In the case of L > M, when all L peers finish downloading in T^, the 
data amount peer L + i has is -^cl+j (refer to Fig.l). As a consequence, any peer in L2 has entirely 
different data proportional to its upload bandwidth from other peers in L2 after the download 
process. Obviously, the initial data that peers in L2 already have after the current download 
process will directly influence how these peers can finish their download in further download 
process. In the remaining sections, we call these data the forwarding data. Now we introduce the 
UP condition as follows: 

Unique and Proportional condition: ( UP condition) The data contained within the same peer 
set is a subset of the whole file, and it has the following two characteristics: 

• Each file piece must be owned by only one peer {unique data); 

• The ratio of the unique data amount to the upload bandwidth of each peer is equal. 




Fig. 1: Illustration of the UP condition 



Let Oj denote the data peer i owns before download, and the ratio of all peers' data amount to 
the file size F is denoted by ip (when ip is 0, it turns to the no-data case), i.e.: 



N 



f = ^ai/F 
1 

We can formally describe the UP condition as follows: 



(8) 



J a\ a2 flA^ 

°- '~ cx ~ C2 ~ ■■■ ~ cjv 


u 


Oi n Oj = 0, \/i^ j 




N 




[ja,CF 




L i=l 





(9) 



where ta is defined as the time interval that each peer can distribute its own unique data to other 
peers. According to Equation (9) and the definition of yj , we have v^ < 1. 

As a supplement to the UP condition, we also introduce the UCP condition as follows: 

Unique- Common and Proportional condition: ( UCP condition) The data contained within the 
same peer set is a subset of the whole file, and it has the following two characteristics: 

• Each file piece must be owned by only one peer {unique data) or be owned by all peers 
{common data); 

• The ratio of the unique data amount to the upload bandwidth of each peer is equal. 




(pF-x (l-(p)F 
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Fig. 2: Data distribution {UP condition and UCP condition) 



Assume the common data, the unique data and the data owned only by the source be x, ipF — x 
and (1 — ip)F respectively, as Fig. 2 shows. The UCP condition for a file F is equivalent to the 
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UP condition for a file F — x, hence, without loss of generality, we only consider the UP condition 
in our extended models. 

The data amount owned by all peers is denoted by Fa, and the data amount owned only by the 
source is denoted by Fq, then: 

TV 

Fa:=J2('i = vF',Fo:={l-v)F (10) 

1 

In order to download the whole file, the source has to upload data Fq to other peers. Let to be 
the bottleneck time, then: 

<o = ^ (11) 

to must be the tightly lower bound of Tl, therefore we have: 

Tl > to (12) 

If ta = to. If can be expressed as q^^, and this value is denoted by ipo for convenience, as 
equation (13) shows. Therefore v^ > v^o is equavalent to to < ta, and vice versa. 

3.2 Equal Service 

In this section, we will extend the single-set problem mentioned in [14] when peers' initial data 
distribution satisfies the UP condition, and calculate T^ according to the relationship between ip 
and ipo, there are two cases to be considered. 

1) V^ > V^O (^0 < ta) 

Without N peers' own upload bandwidth, it takes the source iVto to send Fq to all peers, and 
without the source's help, it takes all A^ peers (iV — l)ta to get Fa. According to the relationship 
of A^to and (A^ — l)ta, we have the following two cases. 

a) {N - l)ta > Nto 

In this case, the source should divide part of its bandwidth x to help peers downloading Fa, 
thus we have (refer to Fig. 3(a)): 

I? I? 

Tl (14) 



U^o-x) j^iU + x) 

When (/9 = 1, peers have the entire file F. The source peer can help every peer send its own 
data to other A^ — 1 peers, and T^ can be directly expressed as ^^~ ^ . 

b) {N - l)ta < Nto 



In this case, we have to < {N — l)ta < Nto- We can find that by taking some bandwidth x out 
of U to download Fq, it will take the same amount of time to download Fq and Fa (refer to Fig. 
3(b)), then we have: 



Fo 



Tr 



i(Co + x) ^(t/-x) ^^ 
Equations (14) and (15) have the same solution: 

_ {N- cp)F 
^ Co + U 



(15) 



(16) 



As all upload bandwidth can be utilized, Tl can also be viewed as the ratio of the total data 
required to transmit to the total upload bandwidth. Obviously, equation (16) still holds when 

ip = l. 

2) ip <ipo (to > ta) 

a) {N - l)t, > to 

In this case, we have iVto > (A^ — l)ta > to, part of U is used to exchange Fo, the bandwidth 
can also be allocated as Fig. 3(b) shows, and Tl can also be calculated by equation (16). 

b) {N - l)t, < to 

Let Un be the total upload bandwidth needed to distribute Fa to all peers, then the remaining 
upload bandwidth Ur,N is U — Un, i.e.: 



t/, 



Af 



(N - l)Fa 



tfl 



, Ur,N — U — Un 



(17) 



As peers must download Fo from the source, the download speed of Fo depends on the relation 
between Ur,N and Cq: 

i) If Ur^N > (A^ — l)Co, Ur^N IS large enough to ensure the download speed of Fo to be Co. As 
a result, the last finish time is constraint by the bottleneck time, i.e. T^ = to. 

ii) If Ur^N < {N — l)Co, Ur,N cauuot maximize the download speed of Fo. We can also use the 
bandwidth allocation strategy in equation (15) (refer to Fig. 3(b)), and Tl can also be calculated 
by equation (16). 





(a) (b) 

Fig. 3: Bandwidth allocation strategies {equal service) 



3.3 Differentiated Service 

In this section, the multi-set problem mentioned in [14] and the Multiplicity Theorem mentioned 
in [15] will be extended and generalized under the UP condition. 

1) V^ > V^o (^0 < to) 

li (f > (fo, no one can finish its download in to, i.e. M = 0, while in the case oi (p = (po, there 
is one and only one can complete in t^, i.e. M = 1. 

a) L = 1 

In this case, the source and all peers from the second peer upload to the first peer at full speed, 
the minimum last finish time can be calculated as: 



Tr 



F -ai 



Co + U-ci 



(1^ 



b) L> 1 

Denote the data amount of the first L peers and the last N — L peers by Fl and F^-l respec- 
tively. If Cn~l is fully used for transmitting F^^l to the first L peers, the time interval is Lta, 
then, in order to exchange Fl in Lt^, ^^Cl of their total upload bandwidth is used and j^Cl is 
left. 

Without the source's help, if all of jCl is still used to download Fa, suppose it takes the first L 
peers Tl to obtain Fa, we have ta < {L — l)ta < Tl < Lta, which means that all of j^Ci can be 
fully utilized. Suppose the first L peers divide part of their bandwidth x to help the last N — L 
peers downloading F;v_l (refer to Fig. 4), then we have: 



N-L 



Solving above equation, we have: 



T,iC 



N-L 



+ x) 



Tl' 



(19) 



Tr' 



LFa - Fl 
U 



(20) 



C 



The source does not upload 




1 

N-L 



c 



N-L peers 
Fig. 4: Bandwidth allocation strategy without the source's help {differentiated service) 

Without using \Cl, it takes the source at most Lt^ to send Fq to the first L peers. The most 
important question of the bandwidth allocation strategy is how \Cl should be allocated. 
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If the first L peers can fully exchange their own data Fl within T, the upload bandwidth 
required is denoted by Ul{L,T). Similarly, the upload bandwidth required to make the last 
N — L peers upload their own data Fjq^L to the first L peers within T is denoted by Un^l{L, T). 
Therefore we have: 

U,(L, T) = <^i>^, C/„_4i. T) = ^ (21) 

According to the relationship between Lto and T^', we have the following two cases, 
i) Lto < Tl' 

If (/9 is larger than some critical value, the source should split its upload bandwidth into three 
parts Cq — xl — xjy^i, xl and x^-l to help downloading Fq, Fl and -F/v-l respectively in the 
same time Tj,, as Fig. 5(a) shows. Then we have: 

° Tl (22) 



X (C'o — Xl — Xn_l) 

{Ul{L,Tl) - xl) + {Um-l{L.Tl) - xr,^L - C^-l) = Cl (23) 

Combining equation (22) and (23), we obtain: 

When ip = 1, peers have the entire file F, then to = 0. All of the source's upload bandwidth is 
used to transmit Fl and F^^l, then we have: 

XL + xn-l = Co, Fl + Fn.l = F (25) 

Combining equation (25) and (23), we get the same solution for T^, as equation (24) shows, 
ii) Lto > Tl' 

In this case, part of Cl should be used to download Fo, and we denote it by Ur^L,N-L{L,T), 
then: 

Ur,L,N-L{L, T) = U- Ul{L, T) - Un-l{L, T) (26) 

In order to make Fl-, Fn-l and Fq finished in the same time Tl-, the upload bandwidth can be 
allocated as Fig. 5(b) shows, and the following equation must hold: 

Tl (27) 



X [Co + Ur^L,N-L{L, Tl)] 

Solving above equation, Tl is also expressed as equation (24). 

Notice that, all above bandwidth allocation strategies in i) and ii) do not need the last N — L 
peers to forward any part of Fq from the source. 

2) ip <ipo (to > to) 
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Suppose L'ta < to < (L' + 1% {L' G [1, A^ - 1)) or (L' - 1% <to {L' = N-1), where L' is an 
integer. For convenience of expression, we use L'ta < ^o < {L' + l)ta {L' G [1, A^ — 1]) to indicate 
the above two cases in what follows, it has no impact on the correctness of the analysis. There 
are three cases to be considered according to the relationship between L and L' + 1. 

a) L = L' + 1 

In this case, T^ may be equal to, larger or smaller than tg. 

i) Tl' > to 

We define To as the minimum downloading time of Fq by using j;Cl, i.e.: 

Fo 
To := max(-r^ ^ r,to) (28) 

Case A: To > Tt^. In this case, as can be illustrated by Fig. 5(c), part of upload bandwidth of 
the last N — L peers should be used for forwarding To from source to the first L peers, which is 
denoted by Ur,N~L{L,T). Correspondingly, part of uplaod bandwidth of the first L peers should 
also be used for downloading To, which is denoted by Ur^L{L,T). Then we have: 

Ur,L{L,T) = Cl- UL{L,T),Ur,N-L{L,T) = Cn-l - Un.l{L,T) (29) 

In order to make Tq finished in Tl, we have: 



{ [Co + (1 - \;)Ur^N-dL,TL) + Ur,L{L,TL)\ 



(30) 



Notice that, Ur^N-L{L,TL) must be multiplied by a factor 1 — 1^, as the source must provide 
data jU^jq^^L^L^to) to the last N — L peers. This can be illustrated by Fig. 1. Most of all, the 
data distribution of the last N — L peers will satisfy the UCP condition as well after the first L 
peers finish their download. 

Solving equation (30), we get: 

Uo + f — jLyN-L 

Case B: To < Lta- In this case, j^Ci can be fully used for downloading Tq in a time interval 
smaller than Lta, then we can use the same bandwidth allocation strategy as Fig. 5(b) shows to 
make Tq and Fa finished in the same time Tl, and Tl can be also expressed as equation (24). 

ii) Tl' < to 

Case A: Tq > Lta- This case is similar to case A of i), and Tl can be calculated in the same 
way (refer to Fig. 4 (c)), as equation (31) shows. 

Case B: To < Lta- If to < {L' + l)ta, Fl and F^^l cannot be finished in to without using j^Cl- 
From equations (21) and (26), we can calculate Ur,L,N-LiL,to) = U — \ — ^^^^- In order to 
make the first L peers finish downloading To within to, we can allocate all upload bandwidth as 
Fig. 5(b) shows, and the following inequality must hold: 

Ur,L,N^L{L,to)>{L-l)Co (32) 

11 



Otherwise, the last finish time cannot reach to, and Tl can be expressed as equation (24). 

h) L<L' + 1 

In this case, we have Tl < L'ta < to- Fig. 5(c) illustrates the bandwidth allocation strategy. 
In order to make the first L peers finish download within to, the following inequality must hold: 



Co< 



Co + il- -)Ur,N-L{L,to) + Ur,LiL,to) 



(33) 



Substituting equation (29) into inequality (33), we can deduce the following inequality: 



Co < (1 - ^)F(L) 



(34) 



c) L> L' + 1 

In this case we have Tl > {L' + l)ta > to, therefore the discussions and equations to calculate 
Tl are the same as in case i) of a) . 




C 

I 

N -L peers 



.t^: 




Fl 



U,..,-ALJ) 



F., 



U„.,iL,T) 



L peers 



N — L peers 



(a) 



(b) 



(c) 



Fig. 5: Bandwidth allocation strategies {differentiated service). In (a) and (b), the last N — L peers 
need not to forward Fo from the source to the first L peers, while the opposite is true in (c). 

Theorem 1: Suppose L'ta < ^o < (L' + l)ta {V G [1, A^ — 1]), if inequality (34) is true when 
L = / + 1, where Z + 1 < L', it is also true when L < /; if inequality (32) is true when L = L' + 1, 
inequality (34) is true when L < L' . 

Proof: The proof of Theorem 1 is in the appendix. 

Based on the above conclusions, we can generalize the Multiplicity Theorem [15] to the following 
^p- Multiplicity Theorem. 

Theorem 2 {i.p -Multiplicity Theorem): For a given P2P system, if the initial data distribution 
satisfies the UCP condition and the initial data ratio is y?, it is possible to let the first M 
peers finish their download in the bottleneck time tg. We define the largest integer M as the 
ip- Multiplicity, and it can be calculated as follows: 

• If to < ta, no peer can finish its download in to, then M = 0. 

• If to = ta, there is one and only one peer can finish its download in to, then M = 1. 

• If L'ta <to< {L' + l)ta {L' G [1, A^ — 1]), there are two cases: If inequality (32) is true when 
L = L' + 1, we have M = L' + 1; otherwise, M equals to the largest integer L no more than 
L' such that inequahty (34) is true. 
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Fig. 6 illustrates the ip- Multiplicity Theorem in terms of the relation between to and ta (or the 
variation of ip). 



No peer can finish downloading in 
bottleneck time. 



In (i7„,(i'+lX^] , at most the first i'+l peers 
can finish their download in bottleneck time. 



(p = \ (p decreases with the increasing of ^j, ) (3 = 



Fig. 6: Illustration of the ip-Multiplicity Theorem 

Similar to section 2.3, we can calculate Tl as follows according to the ip- Multiplicity Theorem. 

1) L<M 

In this case, Tl must be constrained by the bottleneck time to , i.e. Tl = to. 

2) L> M 

a) If To < Lta, Tl is calculated by equation (24). 

b) If To > Lta, Tl is calculated by equation (31). 



4 Numerical results 

Suppose there are 18 peers with upload bandwidth 10, 10, 9, 9, 8, 8, 7, 7, 6, 6, 5, 5, 4, 4, 3, 3, 2, 2 
(KB) respectively and one source peer distributing a file with the size of 1000KB. Fig. 7 gives the 
numerical results of both the original equal service and differentiated service models (i.e. ip = 0) 
and our extended models (i.e. (p G (0, 1]). 

We examine T^ under ip and L in these models. Fig. 7 shows that, the magnitude of ip- 
Multiplicity decreases with the increasing of ip. For a given ip, T^ remains the same when L is 
smaller or equal to the ip- Multiplicity, and increases rapidly with the increasing of L when L is 
larger than the ip -Multiplicity. 

It is worthwhile to note that a smaller ip will seriously limit the utilization of peers' upload 
bandwidth, especially when L is small, hence will lead to a larger T^. This is because when ip 
is small, it will take the source peer a long time to distribute its unique data Tq to other peers. 
On the other hand, when L is large, all of peers' upload bandwidth will be better utilized, the 
increasing of ip will have slight influence on Tl. 



5 Conclusions 

In this paper, we modeled both the equal service process and the differentiated service process 
when peers' initial data distribution satisfies some special conditions (the UCP condition), and 
derived exact expression of the minimum last finish time. Both the UCP condition and the 
proposed models based on it have great theoretical and practical value. 

13 



Extended differentiated 

service model Differentiated service model 



^Extended equal 

service model 

A 



Equal service model 




Fig. 7: Equal service and differentiated service models, [ip G [0, 1]) 

The intrinsic relations among the initial data amount, the size of peer set and the minimum 
last finish time are revealed. Simply speaking, if the total data amount in all peers is small, 
the minimum last finish time is constrained by the source and cannot be further shortened 
by reducing the size of the first peer set. On the contrary, if the total data amount is large, 
the minimum last finish time can be much smaller than in the former case, and fewer peers 
can reach it. 

After the first peer set finishes its download, the data distribution of the second peer set still 
satisfies the UCP condition. Therefore, we can further use our proposed extended models 
to provide equal service or differentiated service to the second peer set. Furthermore, we 
can provide arbitrary degree of differentiated service to a certain number of peers. Suppose 
there are k sets of peers, then we can provide differentiated service from the first peer set 
to the k-th. peer set in turn, the whole download process can be decomposed into k — 1 
nested differentiated service processes and one equal service process, whichever process can 
be characterized by our extended differentiated service model or equal service model. 



Due to there are no limits for peers' upload bandwidth and the size of each peer set, 
proposed models have wide applicability and high extensibility for peer-to-peer file sharing. 
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Appendix 



In this appendix we proof Theorem 1. 
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Let Ur,L,N~L{,L,tQ) denote {l — j^)Ur,N-L{,L,tf)) + Ur,L{L,tf)) in inequality (33). Define a function 
ur{l) sucli tliat: 

ur{l) = Ur,i,N-i'{lM)/{l-l) (35) 

By simple calculation we have ur{l) = -^ ~ ^ ~ ia~i) ' ^'^^^^ 

M'.l)-«KO^%^-|ff<0 (36) 

If inequality (34) is true when L = / + 1, we have ur{l + 1) > Cq, therefore ur{l) > Cq holds, 
i.e. inequality (34) is true when L = I. Obviously, it is also true when L < I. 

Ur,L,N-LiL,to) can be rewritten as Ur,L,N-LiL,to) = U — ° — ^^-^ by definition. Since 

^0 < {L' + l)ta, we have: 

Ur,L'+l,N~{L'+l){L + l,to) < Ur,L'+l,N-(U+l) {L + l,to) (37) 

Combining (35) - (37), we have Ur,L'+i,N~(L'+i){.L' + l,to)/L' < Ur,L',N-L''{L',to)/{L' - 1 ). If 
inequality (32) is true when L = L' + 1, we have f/r,L'+i,Af-(L'+i)(-^' + ^,to)/L' > Cq. therefore 
Ur,L' ,N-L'' {L' ,to) / {L' — 1 ) > Co holds, i.e. inequality (34) is true when L = L' . Obviously, it is 
also true when L < L' . 
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